Segmentation of Overlapped Handwritten Arabic Sub-Words

نویسندگان

  • Hashem Ghaleb
  • P. Nagabhushan
  • Umapada Pal
  • A. Cheung
  • M. Bennamoun
  • S. S. Maddouri
  • V. Märgner
  • N. Ellouze
چکیده

Arabic script is cursive in both handwritten and printed form. Segmentation of Arabic scriptespecially handwrittenis a very challenging task. Many difficulties arise due to the inherent characteristics of Arabic writing such as the overlapping of Arabic sub-words wherein the sub-words share the same vertical space, and vertical ligatures wherein characters are stacked upon each other in a word. In this paper, an algorithm to resolve the overlapping of handwritten Arabic sub-words is introduced. The proposed algorithm is based on pushing strategy;

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Segmentation for Arabic Character Handwriting

The cursive and ligature nature of the Arabic language make the segmentation of words into individual characters a difficult task. Despite attempts to apply methods for cursive Latin and other languages to Arabic, it is generally insufficient to segment Arabic text. This paper proposes a new segmentation algorithm for handwritten Arabic text and the main idea consist of segmenting the word into...

متن کامل

Arabic Handwritten: Pre-Processing and segmentation

This paper is concerned with pre-processing and segmentation tasks that influence the performance of Optical Character Recognition (OCR) systems and handwritten/printed text recognition. In Arabic, these tasks are adversely effected by the fact that many words are made up of sub-words, with many sub-words there associated one or more diacritics that are not connected to the sub-word’s body; the...

متن کامل

Component-based Segmentation of Words from Handwritten Arabic Text

Efficient preprocessing is very essential for automatic recognition of handwritten documents. In this paper, techniques on segmenting words in handwritten Arabic text are presented. Firstly, connected components (ccs) are extracted, and distances among different components are analyzed. The statistical distribution of this distance is then obtained to determine an optimal threshold for words se...

متن کامل

Segmenting Arabic Handwritten Documents into Text lines and Words

In this paper, we present a method for segmenting Arabic handwritten documents into text lines and words. Text line segmentation is addressed by a well-known technique, the horizontal projection profile, in which autocorrelation is used to enhance the self similarity of this profile. This technique promotes the estimation of text line spacing. Word extraction is based on an adaptation of a know...

متن کامل

A New Arabic (ahd/amsh) Handwritten Database

This paper introduces new database for Arabic handwritten words. The Arabic handwritten database (AHD/AMSH) represents a utility to facilitate the experiments of the character recognition algorithms. It contains three types of images: word, isolated character, and digit images. The AHD/AMSH can be used for baseline detection, characters segmentation, normalization, thinning, training and testin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015